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Data are summarized in Scheuneinan* s Score x Group x 
Response frequency table in order to investigate item tias. The data 

can a r ise frcm tw o different sampling models-: — — m-ttit-iiiomia^^ ■ — ■ — 

sampling in wl^ich a fixed sample size is used and the responses are 
cross-classified according to score^ group^ and response; and (2) 
product-multinomial sampling in which for each group a fixed sample 
size is used and the responses are cross-classified according to 
score and response: Data for both sampling models were analyzed using 
two loqit moiScls^ i,e. special cases cf log linear models^ and 
results , of the procedure were applied to Scheuneman's data using the 
program ECTA, The item was uniformly tiased as shown by whites 
performing tetter than blacks in all score categories. Using a 
frequency table derived from Table 2 of Perline, Wright^ and ttainer's 
nine-item scale for parole decision data, the linear legit model and 
the Easch Model were found to be equivalent. Consequently^ the 
estimates for the parameters in the leg linear model yield 
unccnditional maximum likelihood estimates for the parameters in the 
Basch Model. (BL) 
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Mental tests usually consist of items or stimuli with discrete response 
categories such as correct and incorrect answers, true or false, strenght 
of agreement in a nuinber of categories and so on* In classical test theory 
the item responses are scored and the item scores are combined in a total 
test score. For exaniple, in achievement tests the items are usually 
scored zero or one and the total score equals the number correctly answered 
items* In latent trait theory, however, the latent scale is construr,ted 
taking account of the discrete nature of the responses* 

Recently general methods for the analysis of multi-diniensional tables 
have been developed (Bishop, Fienberg & Holland, 3975) • It is hypothesized 
that many psychometric models and methods cetn be translated into contingency 
t£tble methods » In this pape r two topics are disc n^nori , Fi r^^f i fhn ^bL^e^aamuii L" 
of item bias from a contingency table and secondi the formulation of the 
Rasch model as a loglinear model* 

Assessing Item Bias 
Scheuneman (1979) defined an unbiased item as one "for which the 
probability of a correct response is the same for all persons of a given 
ability level"* To investigate item bias the data are summarized in a 
Score X Group x Response frequency table* Constructing the table scores 
with small numberof subjects must be combined* As an example consider 
the frequencies in the three-dimensional tablp derived f rom Scheimeman ' s 
Table 1 ; 
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These data can arise from two different sampling models- Firsts multi^ 
nomial sampling: h fixed sample size is used and the responses are cross- 
classified according to Score^ Group^ and Response. Second^ product-multi* 
nomial sampling: pcr each group a fixe^ sample size is used and the responses 
are cross-classified according to Score and Response (Pienberg^ 1977^ section 
2.4). For both sampling model the data can be analyzed with loglinear 
models. 

For dichotomously scored items the Hasponse variable has only two 
categories^ i.e. a correct and an incorrect response. The response ratio 
is the ratio of the frequencies of the correct and the incorrect rosnonsos* 
For example^ the estimated rr'^ponse ratio for Blacks in Score category 10-11 
in Table 1 is 23/25= c92-"For the special case of only two response cate- 
gories loglinear models can be transfor:ued to logit models for the response - 
ratio (Pienberg^ 1977^ section 6.2). In thie; paper logit models are desc:ribr=;d 
to assess item bias and to investigate the nature of the bias. 

A linear loqit model for the response ratio in the Ijth Score 
category U= U2^...^ r) and ^^^^ Group (^f U2^...^ k^) ist 

with constraints; 



^ G. = 0 . (3) 

and F, are the Dopulation frequencies of the ::orrect and incorrect 
-i jl -ijO 

re£Jponse in the ith Score category and jth Group; in msans the natural 
logarithm. The parameter £ is a constant- The parameters £j and can be 
interpreted^similarly to. .tlie-Piain .effect parameters in an analysis^ of variance 
model; S^. is the effect of the i^th Score cat«agory and the effect of the 
^th Group on the response ratio. 
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If this model does not fit the data a parameter for the interaction 

of, the Score categories and the Group is needed* F*or example, if the 
response ratio in the low Score categories is higher for Blacks than 

for Whites whereas the response ratio in the high Score categories is 

lower for Blacks thain for Whites, Score and Group interact , ^® item is 

biased* ^e nature of the bias is rather con^licated. An item for 

which the interaction between Score and Group is necessary to explain the 

response ratios will be called not-*unifonnly biased. 

If the model fits the data a second loglt model is considered: 

with constraint Formula 2* If this model do^^s not fit the data the item 

is also biased. This conclusion is strengthened by a significant difference 

in fit of both models. In this case the bias is uniform; F&r all Score 

categories the difference between the response ratios of Blacks and Whites 

is constant, in this case the item will be called miformly biased. Finally, if 

model Formula 4 fits the data the item is not biased. 

liogit models are special cases of loglinear models. The loglinear model 

corresponding to the logit model Formula 1 isi 

-djm — — 1 U) —2 {2) *-3jtm) — 1 2 (i j ) ^13(im } — 23 [:)m) , (5) 



with constraints 
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iil ^Uil ^ i^lUl ^ B=0 ii3(ml ^ 

— X ^ - 

i-1 -12 (ij ) ' ^1 -12 (ij) ." ill -13(iTD )'' E^O -13(im) " 



k i 

2^1 —23 [jm ) m=0 —23 [jm ) 



From these Formulas follows the linear logit model: 
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C + + G^. <8) 

In the same way setting the parameters H.23(jm)^'^ model Formula S equal 
to zero yields the logit model Formula 4. Consequently, procedures used 
for loglinear models can be applied to logit models. 

For assessing the fit of a model Pe?.rson's chi^square, 

and the likelihood ratio chi-square 
r Jc i 

» 2 .r i:_ f.. Intf, . V f.. ) <10) 

— ij=l ^ 1 m^O i jm ijm i jm 

are used. In these Formulas f, . is the frequency obtained in a sasiple under 
the multinomial or product -multinomial sampling model. The the 
estimate of the model expected frequency. Both statistics are asymptotically 
chi-square distributed. The degrees of freedom for model Formula 4 are 

tk;-l)£and for model Formula 1 (k-l) (r-1) (Fienberg, 1977, p. 37). A 

2 2 
property of the G -statistic is that the difference in C_- values of the nested raocels 

Formula 4 and 1 is asymptotically chi-square distributed with 

tJc-1) r- ) (r-1 ) = (k-1) degrees of freedom. The computations can be done 

with computer programs such as ECTA (Goodman & Fay, 1974) and BMDP3F (Dixon & 

Brown, 1977) . 

The results of the procedure applied to Scheuneman**s data using the 
program eCTA are reported in Table 2. 
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Table 2 

Models Fitted to Scheuneman's Data 



Model 



G Df Critical Chi-square 

5% 1% 



Score and Group Effect 
(Formula 1) 



1^74 ^ 1^72 



7.81 11.34 



Score Effect (Formula 4) 



25.63 23.93 



9.49 13.28 



Difference G 



22.21 1 3.84 6.63 



Model Formula 1 yields a good fit, whereas model Formula 4 yields a poor 

fit. Moreover, the difference of G^-values of both models is significant 

at the one percent level. The obvious conclusion is that the item is un f ormly bicis^^d- 

Table 3 shows that in all score categories Whites perform better than Blacks. 
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Table 3 



Natural Logarithm ijesponse Ratio 


and Parameter Estimates 


model Fortsula 1, Scheuneman's 






Data 








Black 


White 






-.534 


.534 


Score 




* 




13-14 


. — 1.526 


1.992 


2.996 


12 


.708 


1.099 


2.197 


10-11 


-.288 


--.083 


1.314 


1-9 


-1.946 


1.293 


-.S81 



Note. The estimate of the constant C Is .946. 
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The Rasch Model as a Logllnear Model 

In the Rasch model for binary scored items the probability that 
subject i^ gives a response scored one to item is written as (Rasch, 1960) i 



= exp(A. - D.)/{l+ exp(A, - D,)}, 



(11) 



where ft^ represents subject^s ability and item's difficulty. From model 
Formula 11 follov/s that the logarithm of the response ratio is: 



(12) 



Suppose a test is administered to a group of subjects. After removing all 
items that are scored zero or one by all subjects the test consists of n_ 
binary items. Under the Rasch model a subject's score, i.e. the number of one's, 
is the sufficient statistic for his or her ability (Fischer, 1974, p, 203), 
Therefore the estimates of t]ie pciram^iters can bo obtained as if every subject 

with the same score has the same ability. The number of score groups if^ {n_ 1), 
i*e* the scores run from 0 to n^. in general ^ of these score groups do not 
yield any information. This is always the case for the score groups 0 and n 
because all subjects in these groups have obtained a zero, respectively, 
one on all iten^s- Moreover, it is possible that sonie score groups do not 
contain subjects at all. Therefore, onjy (n_ + 1 ^ p) 
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score groups yield relevant infornation* The response variable has only 
two categories, zero or one* The information cam be summarized in a 
n xCn_ + 1 - £)x 2 Item x Score x fesponse table. As an example consider 
the frequencey table derived from Table 2 of Perline, Wright and Wainer's 
{1979} nine-item scale for parole decision data: 

FrequonciGs in Item x Score x Respons^^ Tabic for Parole Data 
(porline, Wright t Wainer, 1979) 



Respon.so (m) 



Itfiin (3) 


1 


0 






Score ( i ) 


Score (i) 







1 


2 


3 


4 


5 


6 


7 


8 


1 


2 


3 


4 


, 5 


6 


7 


0 




5 


0 


3 


4 


15 


11 


11 


10 


0 


15 


44 


57 


69 


71 


75 


50 


39 




1 


0 


2 


9 


20 


27 


24 


28 


40 


15 


45 


52 


64 


55 


62 


32 


7 




8 


0 


2 


5 


10 


25 


55 


51 


47 


15 


45 


56 


74 


57 


31 


9 


0 




7 


0 


9 


24 


34 


42 


50 


49 


46 


15 


38 


37 


50 


40 


36 


11 


1 




4 


0 


3 


11 


44 


60 


82 


60. 


47 


15 


44 


50 


40 


22 


4 


0 


0 




9 


0 


11 


20 


4 3 


56 


78 


56 


47 


15 


36 


41 


41 


26 


8 


4 


0 




2 


4 


24 


37 


54 


56 


66 


54 


47 


11 


23 


24 


30 


26 


20 


6 


0 




3 


0 


10 


32 


57 


69 


83 


58 


47 


15 


37" 


29 


27 


13 


3 


2 


0 




5 


11 


30 


41 


59 


64 


67 


54 


47 


4 


17 


20 


25 


18 


19 


6 


0 
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Comparing the linear lo'J^t model Formula 1 for this table with the 
Rasch model Formula 12 show.=i that the models are equivalent. Consequently^ 
the loglinear model can be u-^'sd for estimating the parameters and the. 
goodness of fit of the RasJc^ rKXielp Using the program ECTA the model Formula 1 
was fitted to Perline, Wrigr.n and VJainer's data* The fit of the model is 
rather poor:. ,The values of Pearson's and likelihood ratio chi-squares 
are, respectively/ 285»54aj">rT 295*56 with 56 degrees of freedom. The 
patraneter estimates from the ECTA program were compared to the BICAL 
program Rasch model estimat^j-^^ reported by Perline, Wright, and Wainer • 
As Figure 1 shows the relation of the BCT^ and BICAL estimates is almost 
perfectly linear* 



Figure 1 

Plot of Estimates Com/^^-tcd frorii BICAL (Perlinc, Wright i Wainer, 
1979) versjus ECTA for r^urole Data 



Subject ;^ility 



Item Difficulty 
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1 
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In the loglinear model the parai^y.ters are estimated usincj the rr-iixliLiuni 
likt-lihood inotliod. The question is whcUter the estii^iates sro unconditional 
(uKl) Or conditional m.^iximun likeDj.hood {CVJ^) estirriates, 

CML estimates are obtai.nod maximizing tlie likeDihood iunntion v;ith 

respect to a parameter vector conditional on the minimi sufficient 

statistics for the re^iaining n^^dcl parameters in the vector (Anderscj^., 1973, 

p. 37). The estimate for the vectoz 0, is a CML estimate with respect to 0^. 

But the estiinates for tlie parameters in 0. are UML cstinjates witi^i respect to 

'\f 1 

each other. 

Using the linear logit model BDrmula 1 implies that the Item x Score 
table is considered to be fixed by the design. The product multinomial sampliiiC 
is the only design in v/hich conditioning on sufficient statistics is used 
(Bishop, Fienberg a Holland, 197S, p. 63). The likelihood function is condi- 
tional on the sufficient statistics for the paranieters of the Item x Scoi'e 
table, which are fixed by the design. However, this function contains all the 
parameters of FormxiXa - The likelihood is maximized simultaneously V7ith 

respect to the parameters £, and G ^ • Consequently the estimates for the 
parameters in the log-linear inodel yield UKL estimate.^ tor the parameters in 
the Rasch model* 
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